High Dimensional Feature Selection via a Slow Intelligence Approach
نویسندگان
چکیده
Feature selection in high dimensional space is the hot topic in contemporary machine learning area. In the past decade, a lot of effort has been devoted in developing various feature selection algorithms. However, each feature selection method has its own advantage for different datasets or applications. Therefore, one might face the difficulty of choosing the most suitable feature selection method in practice. Reference [1] proposed a new framework called Slow Intelligence System, which can continuously learn, search the appropriate method and propagate information according to the environment to improve the performance over time. In this paper, we adopt this slow intelligence idea in the application of high dimensional feature selection. We propose a new framework which could dynamically choose feature selection algorithm from a pool of methods according to the status quo. We apply our method to cancer datasets with the expression levels of thousands of genes as features. The experimental result shows that our method is superior to individual state-of-the-art feature selection methods. Keywords-Feature selection, machine learning, slow intelligence system
منابع مشابه
Feature Selection for Small Sample Sets with High Dimensional Data Using Heuristic Hybrid Approach
Feature selection can significantly be decisive when analyzing high dimensional data, especially with a small number of samples. Feature extraction methods do not have decent performance in these conditions. With small sample sets and high dimensional data, exploring a large search space and learning from insufficient samples becomes extremely hard. As a result, neural networks and clustering a...
متن کاملA hybrid filter-based feature selection method via hesitant fuzzy and rough sets concepts
High dimensional microarray datasets are difficult to classify since they have many features with small number ofinstances and imbalanced distribution of classes. This paper proposes a filter-based feature selection method to improvethe classification performance of microarray datasets by selecting the significant features. Combining the concepts ofrough sets, weighted rough set, fuzzy rough se...
متن کاملA Monte Carlo-Based Search Strategy for Dimensionality Reduction in Performance Tuning Parameters
Redundant and irrelevant features in high dimensional data increase the complexity in underlying mathematical models. It is necessary to conduct pre-processing steps that search for the most relevant features in order to reduce the dimensionality of the data. This study made use of a meta-heuristic search approach which uses lightweight random simulations to balance between the exploitation of ...
متن کاملEnsemble Classification and Extended Feature Selection for Credit Card Fraud Detection
Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...
متن کاملIntrusion Detection based on a Novel Hybrid Learning Approach
Information security and Intrusion Detection System (IDS) plays a critical role in the Internet. IDS is an essential tool for detecting different kinds of attacks in a network and maintaining data integrity, confidentiality and system availability against possible threats. In this paper, a hybrid approach towards achieving high performance is proposed. In fact, the important goal of this paper ...
متن کامل